Investigating the Security Threat Arising from “Yes-No” Implicit Bias in Large Language Models
Published in AAAI 2025, 2024
Recommended citation: Sendong Zhao, Du et al. (2025). "Investigating the Security Threat Arising from “Yes-No” Implicit Bias in Large Language Models; AAAI 2025.
Download Paper